Estimating Expected Calibration Errors

نویسندگان

چکیده

Uncertainty in probabilistic classifiers predictions is a key concern when models are used to support human decision making, broader pipelines or sensitive automatic decisions have be taken. Studies shown that most not intrinsically well calibrated, meaning their scores consistent with posterior probabilities. Hence being able calibrate these models, enforce calibration while learning them, has regained interest recent literature. In this context, properly assessing paramount quantify new contributions tackling calibration. However, there room for improvement commonly metrics and evaluation of could benefit from deeper analyses. Thus paper focuses on the empirical context classification. More specifically it evaluates different estimators Expected Calibration Error (ECE), amongst which legacy some novel ones, proposed paper. We build an procedure quality ECE estimators, use decide estimator should practice settings.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Calibration results for rank−dependent expected utility

If its utility function is everywhere increasing and concave, rank−dependent expected utility shares a troubling property with expected utility −− aversion to the same moderate−stakes risk at every wealth level implies an extreme aversion to large−stakes risks. In fact, the problem may be even worse for rank−dependent expected utility, since the moderate−stakes risk need not be actuarially fair...

متن کامل

Calibration without reduction for non-expected utility

Evidence from the lab and the field shows that most people exhibit substantial risk aversion over stakes of hundreds of dollars. Expected utility cannot capture nonnegligible risk aversion over such small stakes without producing implausible risk aversion over large stakes, and under the reduction of compound lotteries axiom, neither can nonexpected utility preferences. Motivated by experimenta...

متن کامل

Calibration Results for Non-Expected Utility Theories∗

Rabin [23] proved that a low level of risk aversion with respect to small gambles leads to a high level of risk aversion with respect to large gambles. Rabin’s arguments strongly depend on expected utility theory, but we show in this paper that similar arguments apply to many non expected utility theories, and to a certain extent, to theories dealing with uncertainty as well.

متن کامل

Estimating Errors in Concept Selection

Numerical concept selection methods are used throughout industry to determine which among several design alternatives should be further developed. The results, however, are rarely believed at face value. Uncertainties (or errors) in subjective choices, modeling assumptions, and measurement errors are fundamental causes of this disbelief. This paper describes a methodology developed to predict o...

متن کامل

Estimating Maximum Expected Value through Gaussian Approximation

Theorem 2. If we compare the expected value of DE reported in Equation (4) with the value of the estimator WE in Equation (3), we can notice strong similarities. The main difference is that in DE the sample mean of variable Xi and its probability of being the maximum are computed w.r.t. two independent set of samples, while in WE these two quantities are positively correlated. It follows that W...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-86380-7_12